Spoken Inquiry Discrimination Using Bag-of-Words for Speech-Oriented Guidance System

نویسندگان

  • Haruka Majima
  • Rafael Torres
  • Yoko Fujita
  • Hiromichi Kawanami
  • Tomoko Matsui
  • Hiroshi Saruwatari
  • Kiyohiro Shikano
چکیده

We investigate a discrimination method for invalid and valid spoken inquiries, received by a speech-oriented guidance system operating in a real environment. Invalid spoken inquiries include background voices, which are not directly uttered to the system, and nonsense utterances. Such spoken inquiries should be rejected beforehand. By now, we have reported a method using the likelihood values of Gaussian mixture models (GMMs) to discriminate invalid spoken inquiries from valid ones. In this paper, we improve the performance by utilizing not only the likelihood values but also other information in spoken inquiries such as bag-of-words (BOW), utterance duration, and signal-tonoise ratio (SNR). To deal with these multiple information, we use support vector machine (SVM) with radial basis function (RBF) kernel and maximum entropy (ME) method and compare the performance. In the experiments, we achieve 86.6% of Fmeasure for SVM and 84.2% for ME, while F-measure for GMM-based method is 81.7%.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation of Invalid Input Discrimination Using Bag-of-Words for Speech-Oriented Guidance System

4. Proposed method Evaluation of Invalid Input Discrimination Using Bag-of-Words for Speech-Oriented Guidance System Haruka Majima*, Rafael Torres*, Hiromichi Kawanami*, Sunao Hara**, Tomoko Matsui***, Hiroshi Saruwatari*, Kiyohiro Shikano* *Graduate School of Information Science, Nara Institute of Science and Technology, Japan **Graduate School of Natural Science and Technology, Okayama Univer...

متن کامل

Spoken Term Detection for Persian News of Islamic Republic of Iran Broadcasting

Islamic Republic of Iran Broadcasting (IRIB) as one of the biggest broadcasting organizations, produces thousands of hours of media content daily. Accordingly, the IRIBchr('39')s archive is one of the richest archives in Iran containing a huge amount of multimedia data. Monitoring this massive volume of data, and brows and retrieval of this archive is one of the key issues for this broadcasting...

متن کامل

Operating A Public Spoken Guidance S

Takemaru-kun system is a practical speech-oriented guidance system developed to examine spoken interface through longterm operation in a public place that collected natural humanmachine interaction data. In 2004 the following advances improving reliability of the system were introduced, which conduced acquiring positive increase of access from users: (1) Rejection of unintended speech based on ...

متن کامل

Augmenting Conversations through Context-Aware Multimedia Retrieval based on Speech Recognition

Future’s environments will be sensitive and responsive to the presence of people to support them carrying out their everyday life activities, tasks and rituals, in an easy and natural way. Such interactive spaces will use the information and communication technologies to bring the computation into the physical world, in order to enhance ordinary activities of their users. This paper describes a...

متن کامل

Visually Grounded Learning of Keyword Prediction from Untranscribed Speech

During language acquisition, infants have the benefit of visual cues to ground spoken language. Robots similarly have access to audio and visual sensors. Recent work has shown that images and spoken captions can be mapped into a meaningful common space, allowing images to be retrieved using speech and vice versa. In this setting of images paired with untranscribed spoken captions, we consider w...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012